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ABSTRACT 



We use a Minimum Spanning Tree algorithm to characterize the spatial distribution of Galac- 
tic Far-IR sources and derive their clustering properties. We aim to reveal the spatial imprint 
of different types of star forming processes, e.g. isolated spontaneous fragmentation of dense 
molecular clouds, or events of triggered star formation around Hn regions, and highlight global 
properties of star formation in the Galaxy. We plan to exploit the entire Hi-GAL survey of the 
inner Galactic plane to gather significant statistics on the clustering properties of star forming 
regions, and to look for possible correlations with source properties such as mass, temperature or 
evolutionary stage. In this paper we present a pilot study based on the two 2°x2° fields centered 
at longitudes I = 30° and I = 59° obtained during the Science Demonstration Phase (SDP) of 
the Herschel mission. We find that over half of the clustered sources are associated with Hn 
regions and infrared dark clouds. Our analysis also reveals a smooth chromatic evolution of the 
spatial distribution where sources detected at short- wavelengths, likely proto-stars surrounded 
by warm circumstellar material emitting in the far-infrared, tend to be clustered in dense and 
compact groups around Hn regions while sources detected at long- wavelengths, presumably cold 
and dusty density enhancements of the ISM emitting in the sub-millimeter, are distributed in 
larger and looser groups. 



Subject headings: 
millimeter: ISM 



Stars: formation - Stars: protostars - ISM: HII regions - Submillimeter: stars - Sub- 



1. Introduction 

Observers usually rely on fitting Spectral En- 
ergy Distributions (SEDs), or other distinctive 
spectral features, with theoretical models to probe 
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the physical properties of Young Stellar Objects 
(YSOs), and p ossibly learn about star formation 
processes (e.g., Shu et al. 1987 : Genzel fc Stutzkil 



119891: lAndre et al.lll993l l2000l: lEvandll999h . Like- 
wise, the spatial distribution of YSOs contains 
valuable information about star formation, in par- 
ticular the imprint of gravitational fragmentation 



ticular tnc imprint ot gravitational fragmentation 
in molecular cloud s (Gome z et al ]l!993t iHartmannl 



2002t lAllen et all 120071; ISchmeia et all 120081 ). or 



mass (Kirk & Mverd 


2010) or evolutionarv 


(Gutermuth et all 


20091 Carlson et al. 2010h 



analysis of clustering properties is in fact comple- 
mentary to the spectral approach in the sense that 
it relates to an entire population of objects rather 
than individual sources. This however calls for a 
very large sample of YSOs. In addition, to probe 
the initial spatial distribution of forming stars, or 
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rather star clusters (ILada fc Ladall2003l ). it is best 
to observe the youngest population of YSOs be- 
fore they diffuse away from their stellar nursery 
while the gas evaporates from the disrupting clo ud 
(jBastian et alj|2008t IProszkow fc Adamsll2009t) . 

We plan to use the H erschel Space Observa- 
tory (|Pilbratt et al.ll20ld ). operating in the far- 
Infrarcd/sub- millimeter regime and covering the 
peak emission of the youngest YSOs, to study 
their clustering properties and tentatively relate 
the observed spatial distribution with different 
mechanisms of star formation. We will also search 
for correlations between clustering and YSO phys- 
ical properties - such as mass, temperature, evo- 
lutionary stage - as well as YSOs immediate envi- 
ronment - density, radiation field, Hn region. 

Herschel will observe over 270 square degrees 
of the inner Galactic plane as par t of the Hi- 



GAL survey (jMolinari et al.l l2010al ). With its 



unprecedented angular resolution, sensitivity and 
spatial coverage, we expect Hi-GAL to detect 
tens of thousands of sources. This will consti- 
tute a very large data set that should enable us 
to reach high statistical significance for identifying 
YSO clustering trends. In this article we present 
a pilot study of our project based on the two 
2°x2° fields centered in the Galactic plane at lon- 
gitudes I = 30° and I = 59° obtained during the 
Science Demonstration Phase (SDP) of the Her- 
schel mission. The scope of this article is therefore 
more modest than if using the entire survey due 
to the lower statistics available (only 3% of the 
survey was covered during the Herschel SDP), yet 
it presents the methodology and associated diag- 
nostic tools we have developed in preparation to 
exploiting the entire survey. 

In section [2] we present the observations of 
the two SDP fields and the catalog of extracted 
sources we use in our analysis. In section [3j wc 
describe our approach to characterize the spatial 
distribution of Hi-GAL sources using a Minimum 
Spanning Tree algorithm and exploiting the helio- 
centric distance informati on recently obtaine d by 
the Hi-GAL consortium ( Russeil et al. 20 111) We 
discuss our results in section [H in particular the 
wavelength dependence we find on the clustering 
properties of the YSOs, as well as the relation be- 
tween clustering and cloud fragmentation, or the 
presence of Hn regions. Finally we give our con- 
clusions and prospects for the entire Hi-GAL data 



set in section [5] 



2. Observations and Source Catalog 



Th e Herschel Space Observatory (jPilbratt et al 
2010l ) observed two 2.1 x 2.1 square degree fields 
as part of the Science Demonstration Phase of the 
mission in November 2009. These two target fields 
were chosen from the Her schel Infrared GALac - 
tic plane survey (Hi-GAL, iMolinari et alboiOal ). 
and are approximately located on the Galactic 
plane at longitudes / = 30° and I — 59°. Ob- 
servations wer e carried out in the SPIRE /PACS 
paral lel mode (|G riffin et al.|[2oTol iPoglitsch et al 



2010[ ) at fast scan speed (60"/s) in two orthogo 
nal directions. This observing strategy provides 
simultaneous imaging in five bands centered at 70, 
160, 250, 350, and 500 /j,m, and an angular resolu- 
tion varying from ^10" to 40". The cross-scanned 
observations are used to preserve the extended 
emission from the interstellar medium (ISM) dur- 
ing the map-making process. Maps are created 
with the ROMAGAL algorithm (Traficante et al., 
submitted), and th ree-color image s are pr esented 
in figure 1 and 2 of Molinari et al.l ( 2010al ) . 

The morphology of the I = 30° field in Her- 
schel bands is mostly shaped by two luminous 
massive star-form ing complexes, nam e ly the mini- 
starburst W43 (iMotte et all 120031 iBallv et al 



2010) located in the inner arm of our Galaxy at 
5.8 kpc from the Sun, and the ultr a compact Hn 



regio n G29.96-0.02 (hereaft er G29. iBeuther et al 
2007) located at -8 kpc. IPaladini et all (|2003l ) 



find an additional 23 Hn regions in this 4 square 
degree field. The I = 59° field i s dominated by th e 
large OB association VulOBl (jBillot et al.ll2010l ). 
located in the Sagittarius arm at a distance of 
2.3 kpc. 

Source detection at these wavelengths is a com- 
plex task due to the rich structure d backgrounds 



present in the Galactic plane. IMolinari et al 
(|2010br ) have developed a method, based on the 
second derivatives of the maps, that filters out 
low spatial frequencies along multiple directions 
and reveals compact sources that exhibit strong 
signal gradients in the image. The photometry 
is then measured by fitting multiple Gaussians 
to the detected groups of pixels that have a sec- 
ond derivative value above a given threshold (see 
Molinari et al.l for details). The source extrac- 
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tion is carried out independently at the five wave- 
lengths. Sources detected at 70, 160, 250, 350 
and 500 /im are the n band-merged, following the 
method described in Elia et al. ( 2010t ). to form the 
source catalog that we use as a starting point for 
the spatial distribution analysis presented in this 
article. Note that we will use the term single- 
wavelength source catalogs in section |4] when re- 
ferring to single columns of the complete band- 
merged catalog, i.e. all the sources detected from 
a single image at a particular wavelength. Ta- 
ble Q] gives the numbers of sources extracted in 
both fields and in each Herschel band individu- 
ally, as well as the total number of sources in the 
band-merged catalog. 

Furthermore, there has been a significant ef- 
fort within the Hi-GAL consortium to measure 
the distance to most sources detected in the SDP 
fields in order to derive physical parameters such 
as the mass or the luminosity from the mea- 
sured fluxes. Distances were estimated using a 
multi-wavelength approach, exploiting both spec- 
tral line emission (kinematic distances) as well 
as extinction maps, parallax measurements, and 
physical connection with objects at known dis- 
ta nces. Details of the methodology arc described 
in lRusseil et al. ( 2011 ). In total, over 2000 sources 
(>90%) possesses a distance estimate. 

3. Characterization of the Spatial Distri- 
bution 

Several mathematical tools are available to 
characterize the spatial distribution of a set of 
localized points, e.g. the two-point correlation 
function, the nearest-neighbor filtering, the min- 
imum spanning tree, the mean surface density of 
companion, the Voronoi tessellation, etc. All these 
methods have been successfully applied to astro- 
nomical data sets to detect YSO clusters in star 



form i ng regions (IGomez et al.l 119931: iH artmann 



2002tlKarr fe Martirj|2003tlCartwright et al.l | 200 



Schmeia fe Klessen 20061 : IChavarria et al.l I200£_ 
Gutermuth et al.l l2009t) The process of cluster 
identification always requires the determination 
of a threshold to isolate source overdensities from 
the underlying population of distributed objects. 
This threshold can take the form of a cutoff source 
surface density or a cutoff source separation, de- 
pending on the chosen approach, and it plays a 



crucial role in deriving cluster properties. Here 
the term cluster is used to designate source over- 
densities that contrast against the distribution of 
field objects. In the following analysis, the as- 
signment of cluster membership is solely based 
on morphological grounds, without any kinematic 
information. Consequently, the detected clusters 
might be gravitationally bound entities as well as 
loose associations simp ly tracing regions of star 
forma tion as defined by (deles fe Portegies Zwart 
(|201lh . 

In general the characterization of spatial dis- 
tributions remains vague and imprec ise. For in- 
stance , from a census of recent studies. iBressert et al 
( 2010f ) point out that the fraction of sources in 
clusters can vary from 40 to 90% depending on 
the adopt e d defi nition of a cluster. Furthermore, 
Schmeia I (|2010f ) has conducted a quantitative 
comparison of 4 different algorithms to identify 
star clusters in a field, and they all exhibit vari- 
able efficiencies depending on the size and charac- 
ter of the investigated area and the purpose of the 
study. 

In the present article, we follow the m ethodol- 
ogy presented in Gutermuth et al. ( 20091) to study 
the spatial distrib ution of the Hi-GAL data set. 
Gutermuth et all argue in favor of the Minimum 
Spanning Tree (MST) algorithm for several rea- 
sons: (1) the 'overdensity' threshold is derived in a 
systematic manner from the data itself, (2) it cre- 
ates fully connected entities rather than islands 
of isolated groups with few sources as with the 
nearest-neighbor approach, and (3) there is no in- 
herent smoothing associated with the MST anal- 
ysis so that there is no bias with regard to the 
shapes of the clusters one can isolate. 

3.1. The Control Distribution 

Throughout the remainder of our source clus- 
tering analysis, the spatial distribution of the ob- 
served sources is systematically compared to a 
control distribution that exhibits no clustering 
properties, apart from the confinement of sources 
within the Galactic plane. This control distribu- 
tion is used as a reference to help interpret our 
results. It has the same spatial coverage, and con- 
tains as many sources, as the observed fields. It 
is generated by drawing the Galactic longitude co- 
ordinate from a constant probability distribution, 
i.e. source aggregates are solely of statistical ori- 
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Table 1 

Number of entries in the source catalog per band and per field. 



Band 


70 fim 


160 /im 


250 fim 


350 


500 /im 


Total* 


I = 30° 


698 


679 


758 


785 


592 


1565 (1388) 


I = 59° 


336 


389 


675 


578 


515 


1113 (718) 



indicates the total number of entries in the band-merged catalog having 
at least one detection in one of the Herschel bands. The slanted number in 
parenthesis indicates the number of sources for which a distance estimate 
is available. 



gins, and the Galactic latitude coordinate from 
a Gaussian distribution in order to reproduce a 
repres entative source overdensity in the Galactic 
plane ( Ferrierd 2001 ) . Sources separated by less 
than 6" are removed from the control distribu- 
tion to account for the limited spatial resolution of 
the Herschel telescope at 70 /im. This represents 
about 0.5% of the total number of sources and this 
has a minimal impact on the shape of the distribu- 
tion. Figure [1] presents the spatial distribution of 
the sources detected in the I = 30° field at 70 ^m 
as well as the associated control distribution. 

3.2. The Distance Matrix 

The computation of the distance matrix is the 
initial commonality to most approaches aiming at 
characterizing the spatial distribution of a given 
data set. Given a set of N sources of known po- 
sitions, the element (i,j) of the distance matrix 
is the angular distance separating the i th and j th 
sources. The matrix dimension is therefore N x N, 
and it is symmetrical with zeros along its diago- 
nal. It contains information from the smallest to 
the largest scales. The inclusion of the heliocentric 
distance information is discussed in section f3. 41 

Figure [5] presents the histogram of the distance 
matrices derived from the two distributions of Fig- 
ure [TJ and it shows that our data set exhibits an 
excess of short spacings compared to the control 
distribution, which provides the first qualitative 
evidence for source clustering in the observed field. 
The bump on the solid black line at 6 ~ 0.75° rep- 
resents the average source spacing that exists be- 
tween the two star forming regions W43 and G29 
which host a large fraction of the detected sources. 



3.3. Minimum Spanning Tree and Cluster 
Identification 

A minimum spanning tree is 'grown' by con- 
necting a set of points by a network of lines, or 
branches, and by minimizing the total length of 
the branches while making no closed loops in the 
connections. There is a unique solution to this 
mathematical problem as long as each spacing 
in the distance matrix has a unique value. We 
use a custom IDL routine based on the Prim's 
algorithm to generate MSTs from the Hi-GAL 
source catalogs. The top panels of Figure [3] 
present the minimum spanning trees we have de- 
rived using all entries of the band-merged cat- 
alogs for the I = 30° and I = 59° fields, re- 
spectively. Source aggregates contrast well with 
the low source-density background, and we iden- 
tify groupings of objects by fol l owing the defini- 



tion given in iGutermuth et all (|2009h . We 



ploit the distribution of MST branch lengths to 
define a critical length: all the sources connected 
by branches shorter than the critical length are as- 
sumed to belong to a group, or a source overden- 
sity where sources are closely spaced, otherwise 
they are assumed to be isolated. The determi- 
nation of the critical length is somewhat arbitrary 
sinc e it is not based on physical assumptions, how- 
ever Gutermuth et all used simple test case mod- 



els to show that cluster-like structures can be iso- 
lated from a low-density source distribution when 
a critical length is derived by fitting the cumula- 
tive distribution function (CDF) of MST branch 
lengths with segments, and by choosing the criti- 
cal length as the intersection point of the linear fits 
to the lower and upper ends of the branch length 
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SDP 130 source distribution 



Reference source distribution 




Fig. 1. — Spatial distribution of all entries of the band merged catalog for the SDP I = 30 field, and the 
distribution of sources randomly distributed in a plane to serve as a reference, or control, distribution. 
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Fig. 2. — Histograms of reciprocal angular dis- 
tances derived from the two distributions pre- 
sented in Figure [TJ The histograms are normalized 
to the total number of sources in each distribution. 



scale. 

The bottom panels of Figure|3]show the CDF of 
the MST branch lengths derived for the observed 
fields as well as for the control random distribu- 
tion described in scction l3~2l The control distribu- 
tion CDF appears to be more symmetrical and to 
peak at longer branch lengths than the observed 
sources, which means that short MST branches 
are more numerous in the observed fields, i.e. that 
Hi- GAL sources are more clustered than randomly 
distributed sources. Note also that the minimum 
source spacing possible in the Hi-GAL fields is set 
by the spatial resolution of the telescope, while 
it is set manually in the control distribution, so 
that MST branch lengths can only populate the 
histograms down to 6". The observed CDF is 
well fitted by segments, and the critical length is 
found to be quite similar for both fields, ~80" and 
100" for the I = 30° and I = 59° fields, respec- 
tively. Nevertheless the shorter critical length for 
the I = 30° field can be attributed to the higher 
source density found around W43 and G29 com- 
pared to Vul OBI in the I = 59° field, which makes 
the CDF steeper at the shorter end of the branch 
lengths and thus displaces the intersection of the 
fitted segments toward shorter spacings. 

Bressert et al. (l2010h give a brief summary of 



the various methods recently used in cluster iden- 
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tification, and they point out that the derived frac- 
tion of clustered sources can vary substantially for 
different methods (from 40 to 90 %). In our case, 
we follow Gutermuth et al. (2009) and we define a 
cluster as being a group of sources connected by 
branches shorter than the critical length, and that 
contains more than 10 members. We find that 
~70% of the sources are associated to a cluster. 



Gutermuth et al.1 find very similar clustered frac- 
tions in nearby young star-forming clusters. The 
cluster identification method appears to be quite 
reliable for our data set as it picks up all the over- 
densities that can be recognized by eye, e.g. W43, 
G29, and Vul OBI for the largest associations, but 
also smaller substructures. The identified clusters 
are encircled by convex hulls (see section 14. ip in 
Figure [3] 

We have also tried the a lternative approach 
pr esented in Battinelli (|199ll ). and more recently 



Koenig et al.l ( 2006 ) , to compute the critical 



branch length by deriving the number of clusters 
found in a given data set as a function of the cut- 
off branch length. For instance, starting from a 
cutoff length equal to the shortest branch of the 
MST, we find a single group with 2 members, i.e. 
the two sources that exhibit the shortest spacing, 
and then as the cutoff length increases and loosens 
the constraint to detect clusters, more clusters are 
found. The number of clusters is then expected to 
decrease when the cutoff length increases further 
due to the coagulation of groups into very few large 
clusters until the cutoff length is large enough to 
find a single cluster encompassing all the sources 
in the field . This curve the refore reaches a max- 
imum, and iBattinellil ( 1991 ) argues that the cut- 
off length for which the maximum is reached is 
the critical length to consider for the MST as it 
accompli shes the requ i remen t of maximum infor- 
mation. iKoenig et al.1 (|2008l ) find a smooth bell- 
shaped function for the W5 region with a well de- 
fined peak. However the curve we derive for the 
Hi-GAL sources has multiple local maxima ren- 
dering the definition of a critical length rather un- 
certain. We thus conclude that this approach is 
not suitable for our data set, presumably due to 
the presence of separate star forming complexes 
in the field at various distances combined with 
the finite spati al resolution of the ob servatory as 
pointed out by iBastian et al. (<2007h . We there- 



considering the whole Hi-GAL survey. 

3.4. Including the distance information 

The distance e stimate, available fo r over 90% of 



Hi-GAL sources (jRusseil et al.l 1201 lh . is a crucial 



piece of information for our analysis of clustering 
properties. In particular, we exploit distance esti- 
mates to separate sources along the line of sight, 
and to convert angular distances in the sky into 
linear distances. 

Our initial approach was to consider all sources 
with a distance estimate, compute Euclidean dis- 
tances between those sources in 3-D space, and 
then create three-dimensional minimum spanning 
trees. This method would in theory be the most 
appropriate to recover source clustering proper- 
ties since it offers the best rejection of fortuitous 
associations due to projection effects. However, in 
practice, it fails to give satisfactory results because 
of the relatively large uncertainties associated with 
the estimated radial distances, ~0.6-0.9 kpc (Rus- 
seil et al., private communication), compared to 
the tangential source spacings. 

We therefore opted for a more pragmatic ap- 
proach exploiting the fact that Hi-GAL sources 
are mai nly found within th e spiral arms of the 
Galaxy (jRusseil et al.l 120111 ) to segregate sources 
per heliocentric distance bins, and derive indepen- 
dent MSTs and cluster properties for each bin. 
The bottom panels of Figure [4] show the distri- 
bution of heliocentric distances in each field. The 
histograms possess well defined peaks that trace 
source overdensitics located in the Galactic arms. 
The top panels present the 2-D MSTs and iden- 
tified clusters per distance bins. The major im- 
provement with this approach comes mainly for 
the I = 30 field, for which the line of sight 
crosses 3 spiral arms. Indeed we find neighbor- 
ing, and even overlapping, clusters that belong to 
different spiral arms and are actually several kilo- 
parsccs apart. Including the distance information 
thus allows a clear separation of the clusters, which 
was not possible when treating the problem in two 
dimensions as in Figure |3l 



fore expect these fluctuations to get worse when 
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MST and clusters in SDP 130 field 
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Branch cutoff length: 82 
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Fig. 3. — Top panels: Minimum Spanning Trees derived from the band merged catalog for the I = 30° (left) 
and I = 59° (right) fields. The total number of sources considered here is given at the bottom right of the 
plots. The light grey segments are the branches of the MST that connect all the detected sources (light 
grey asterisks) . Dark grey squares represent sources that are connected by branches shorter than the critical 
branch length. Clusters of sources containing more than 10 members are encircled by a convex hull made 
of black segments (see text for details). Bottom panels: Histograms of the branch lengths for the derived 
MSTs, as well as for the control distribution for comparison. The cumulative di stribution functions are fitted 
by straight lines of the lower and upper parts of the branch length scale (see Gutermuth et al. 20091) . and 
their intersection (vertical dotted-dash line on the plot) defines the critical branch length. 
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Clusters per distance bins in SDP 130 field 
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Clusters per distance bins in SDP I59 field 
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Fig. 4. — Minimum Spanning Trees derived per heliocentric distance bins. Bottom panels: Histograms of 
source heliocentric distances for the two SDP fields. Sources belong to distinct arms of the Galaxy and 
can therefore be segregated per distance bins (vertical dot-dash lines define the distance bins). Top panels: 
Clustering properties are extracted from MSTs derived independently per distance bin (distances are grey- 
scale coded). The symbol convention is as in Figure [3] 
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4. Results and Discussion 



4.1. Cluster Characterization 



Fol lowing the formalism of iGutermuth et al 
( 20091 ). we compute the morphological proper- 
ties of clusters identified per distance bin (cf sec- 
tion I3.4j) . Results are listed in Table [2] for each 
SDP field. 

The coordinates assigned to a cluster is the me- 
dian value of the individual cluster members coor- 
dinates. Similarly, the heliocentric distance to the 
cluster is derived as the average distance to indi- 
vidual cluster members. The associated standard 
deviation is also given in Table [5] and represents 
the source spread along the line of sight, it is thus 
a rough indicator of the depth of the cluster. 



The circular radius, R c 



is calculated as half 



of the largest distance between any two members 
of a cluster. In other words, it is the radius of 
the minimum area circle that encloses the entire 
grouping. In addition, to account for the non- 
circular geometry of most clusters, we compute 
an effective area, Ahull , by drawing a convex 
around each grouping, and by computing the as- 
sociated effective radius R huH = A adjusted /ir, 
where A adjusted = A huU /(l - n huU / / n tota i) is the 
adjusted effective area of the hull, rihuii is the 
number of sources located on the hull, and ntotai 
is the total number of sources in the cluster. Ac- 



cording to ISchmeia k, Klessenl (|2006f ) , the use of 
Aadjusted is more appropriate than Ahull because 
it has been slightly enlarged to account for all the 
sources located on the hull, i.e. vertices that are 
not strictly enclosed in the polygon. We also de- 
rive the quantity R 2 circ l ' R\ u u as an estimator of 
the cluster aspect ratio, and A ad j USted /ntotai as the 
mean surface density of source in the cluster. The 
last entry of Table [5] is the median MST branch 
length measured in the clusters, and converted 
from arcminutes to parsecs using the estimated 
cluster heliocentric distance. We discuss the rel- 
evance of this quantity to study the imprint of 
gravitational fragmentation in molecular clouds in 
section [ 



lr rhe convex hull is derived using the triangulate. pro IDL 
routine, which computes the minimum area polygon that 
contains a set of points such that all internal angles between 
adjacent edges are less than 180°. 



Figure [5] shows the distribution of some of the 
quantities presented in Table O We find that 
most clusters are small, with a median cluster 
size of 17 sources. The three largest groupings 
are located at the position of the three Hll re- 
gions W43, G29 and Vul OBI. They contain over 
100 sources each, which represents over half the 
clustered sources, and a third of all the sources 
in the SDP fields. The median effective radius 
is about 5.8 pc with a significant spread from 
0.8 to 21 pc. At the low end of this range, the 
size is typical of long-lived gravitationally bound 
clusters. However large groupings, with sizes over 
■~5-10 pc, are likely unbound structures associated 
with ongoing star forming activity too distant for 
Herschel to resolve individual substructures. The 
source surface density also has a significant spread 
in value, from ~1 to 4.6 sources. arcmin~ 2 , with 
a median value of 1.98 sources. arcmin -2 . Ta- 
ble [5] also gives the surface density in units of 
sources.parsec -2 , but this physical parameter is 
not quite relevant for studying the most distant 
clusters (up to ~12 kpc in our sample) due to 
completeness issues (we only measure the surface 
density of massive protostars) and spatial reso- 
lution limitations (cf section 14. 2p . Most clusters 
tend to be elongated, with a median aspect ratio 
of 1.5, which property could be inherited from the 
primor dial structure of their parental molec ular 



cloud ( Teixeira et all 12009 : lAllen et all 120071 ). In 



rare cases, the aspect ratio of ci rcular small clus 
ters can reach values below 1. IGutermuth et al 



(|2009h argue that these unphysical values arise 
from the increasing uncertainty of A ad j USted as the 
number of source members decreases and as the 
number of members that are convex hull vertices 
increases. 

The cluster identification and characterization 
methods, as described above, seem to be reliable 
in retrieving cluster morphological parameters. 
Nevertheless, we consider the possible systematic 
effects introduced in our analysis by the choice of 
the 10+ cluster size limit, and the cutoff branch 
length. For instance, if we allow the clusters to 
contain fewer sources, then many more smaller 
clusters are detected and the parameters distri- 
bution changes accordingly. The determination 
of morphological parameters for large clusters is 
however immune to a decrease in the cluster size 
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Table 2 

Cluster characteristics in Hi-GAL SDP fields. 



ID 


Glon a 


Glat a 


Number of 


Distance 






Aspect 


Source density 


Median 




[Degree] 


[Degree] 


sources 


[kpc] 


[Arcmin] 


[PC] 


ratio 


[arcmin ] 


[pc- 2 ] 


Branch [pc] 


I = 30° field 


1 


30.549 


0.0297 


12 


11.6 ± 0.19 


3.18 / 4.19 


10.8 / 14.1 


0.57 


4.59 


52.7 


3.72 


2 


30.826 


-0.134 


12 


11.2 ± 0.07 


3.30 / 3.03 


10.7 / 9.89 


1.18 


2.41 


25.6 


2.78 


3 


30.959 


0.5884 


12 


12.9 ± 0.05 


2.44 / 2.20 


9.20 / 8.30 


1.22 


1.27 


18.0 


2.70 


1 


29.158 


0.0250 


19 


8.97 ± 0.07 


5.83 / 3.69 


15.2 / 9.65 


2.48 


2.26 


15.4 


3.12 


5 


29.932 


-0.038 


129 


8.55 ± 0.22 


7.90 / 6.89 


19.6 / 17.1 


1.31 


1.15 


7.16 


1.74 


6 


30.029 


0.1020 


11 


8.13 ± 0.47 


2.59 / 1.98 


6.15 / 4.68 


1.72 


1.12 


6.27 


2.23 


7 


30.291 


-0.219 


39 


8.04 ± 0.14 


6.15 / 4.96 


14.4 / 11.6 


1.53 


1.98 


10.8 


2.04 


8 


30.426 


-0.223 


13 


7.96 ± 0.09 


3.07 / 2.51 


7.13 / 5.81 


1.50 


1.52 


8.18 


2.01 


9 


30.855 


-0.105 


22 


7.65 ± 0.79 


3.67 / 2.60 


8.17 / 5.80 


1.97 


0.97 


4.81 


1.49 


10 


30.733 


-0.021 


344 


5.65 ± 0.37 


18.3 / 12.5 


30.1 / 20.6 


2.13 


1.43 


3.89 


1.16 


11 


30.692 


-0.269 


16 


5.52 ± 0.10 


2.57 / 2.66 


4.13 / 4.27 


0.93 


1.39 


3.59 


1.14 


12 


30.489 


-0.358 


20 


0.89 ± 0.02 


4.09 / 3.33 


1.06 / 0.86 


1.50 


1.74 


0.11 


0.25 


I = 59° field 


13 


58.734 


0.6374 


10 


5.48 ± 0.16 


10.1 / 5.54 


16.1 / 8.83 


3.34 


2.41 


6.13 


1.76 


11 


58.512 


0.3404 


16 


2.63 ± 0.14 


5.74 / 3.25 


4.39 / 2.49 


3.11 


2.08 


1.21 


0.90 


15 


58.998 


-0.239 


16 


2.30 ± 0.00 


3.64 / 2.83 


2.43 / 1.89 


1.65 


1.57 


0.70 


0.67 


16 


59.189 


-0.332 


23 


2.30 ± 0.00 


4.25 / 3.86 


2.84 / 2.58 


1.20 


2.04 


0.91 


0.57 


17 


59.490 


-0.215 


132 


2.30 ± 0.00 


12.8 / 9.33 


8.61 / 6.24 


1.90 


2.07 


0.92 


0.54 


18 


59.422 


0.0145 


12 


2.30 ± 0.00 


3.09 / 2.79 


2.06 / 1.86 


1.22 


2.04 


0.91 


0.53 


19 


59.128 


-0.117 


15 


2.30 ± 0.00 


2.87 / 3.27 


1.92 / 2.19 


0.77 


2.24 


1.00 


0.65 


20 


59.811 


0.0528 


56 


2.30 ± 0.00 


8.81 / 6.07 


5.89 / 4.06 


2.10 


2.06 


0.92 


0.59 


21 


59.787 


0.2275 


17 


2.30 ± 0.00 


2.82 / 2.30 


1.89 / 1.54 


1.50 


0.98 


0.43 


0.52 



a Clustcr coordinates, expressed in Galactic longitude and latitude, represent the median coordinates of individual cluster members. 
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limit (see iBastian et al.l (|2007t) for a deeper anal- 
ysis). The limit of 10 members was actually cho- 
sen to be consistent with previous studies so as 
to make results comparable. And indeed they 
are: we find that 70% of Hi-GAL sources are 
identified as cluster members, which matches the 
clustered fractions of 60-80% found for Spitzer- 
identified YS Os in various Galactic star forming 
regions (e.g. lAllen et alJ l2007tlKoenig et al.Eooa 



Gutermuth et alj|2009) . 



The determination of the cutoff branch length 
is also somewhat arbitrary as it does not ex- 
ploit any physical properties of the sources or 
immediate environment; kinematic measurements 
are the only way to establish unambiguous clus- 
ter membership. But one has to compromise to 
study clustering properties over very large sam- 
ples (~T0 5 sources are expected to be detected 
in the Hi-GAL survey once completed) and this 
translates into the arbitrary choice of a threshold, 
be it a cutoff source surfa ce density (jLada fc Lada 



l2003tlSchmeia et al 
dSchmei 



Klessen 



20081) o r a MST branch lengt h 
Koenig et all l2008h . 



2006 



Gutermuth et al.l ( 20091 ) argue that defining such 



a threshold from MSTs in a systematic manner 
for each field independently is a significant advan- 
tage over the more pragmatic definition of fixing 
a threshold surface density value that can poten- 
tially vary from region to region and thus miss 
or misidentify clusters. We therefore opted for 
Gutermuth et al. (2009) approach to define the 
cutoff MST branch length (cf section 13. 3[) , keep- 



ing in mind that this quantity remains somehow 
arbitrary. 

4.2. Cloud Fragmentation 

If we consider the idealized scenario of Jeans 
fragmentation in a uniform isothermal molecular 
cloud, gravitational instabilities can lead to the 
collapse and subsequent fragmentation of spher- 
ical cores when the local gas pressure can no 
longer support the gravitational pull of the en- 
closed mass. The hydrostatic equilibrium crite- 
rion can be expressed as the Jeans mass Mj, or 
equivalcntly as the Jeans length: 



'15/csT 
AnGpp 



(1) 



where ks is the Boltzmann constant, G is the grav- 
itational constant, T and p are the temperature 
and density of the cloud, and p is the average mass 
per particle in the cloud. 

Assuming a typical temperature of 20 K and 
density of 10 5 cm -3 for a molecular cloud, we find 
A j ~ 0.1 pc. This quantity represents a charac- 
teristic core size which should leave its imprint on 
the source spatial distribution in the fragmented 
cloud. It is therefore to be compared with the 
typical source spacing found in protostellar clus- 
ters, before they migrate from their birth place 
and wash out their initial spatial distribution. 

Figure [5] shows the median MST branch lengths 
measured in the 21 clusters identified in sec- 
tion [3T4J as well as the Jeans lengths derived from 
equation (JXJ) for two sets of parameters brack- 
eting the typical physical conditions found in 
the dense and cold interstellar medium, (T, p) = 
(20 K, 10 5 cm" 3 ) and (30 K, 10 3 cm" 3 ). We find 
that, for most clusters, the median MST branch 
length is larger than the Jeans lengths. To in- 
terpret this result, we compare the median MST 
branch lengths, function of the cluster helio- 
centric distance, with the angular resolution of the 
SPIRE and PACS instruments (see Figure [6]), and 
it turns out that the typical source spacing is sys- 
tematically larger than the SPIRE 500 beam size. 
This is an indication that the Herschel telescope 
cannot resolve spacings shorter than the Jeans 
length at kiloparsec distances in the dense inter- 
stellar medium. Completeness might also come as 
a limitation since we are mostly sensitive to bright 
and massive protostars, leaving the intermingled 
low-mass protostars undetected, which introduces 
a bias toward larger source spacings. 

4.3. Chromatic Spatial Distribution 

From molecular condensations, to pre-stellar 
cores, envelop-, and then disk-accreting proto- 
stars, the peak emission from young stellar ob- 
jects shifts from the millimeter to the mid-infrared 
regim e as they evolve (e.g 



Andre et al.l 11993 , 



2000h . The Hi-GAL survey spans over one order 
of magnitude in wavelengths, covering most of the 
above wavelength range. We therefore expect the 
5-band observations to reveal young stellar objects 
in various stages of evolution, and possibly evi- 
dence different clustering properties for different 
evolutionary stages. 
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Fig. 5. — Cluster core statistics derived from the 
MST analysis. The vertical dot-dashed lines indi- 
cate the median values of the considered quanti- 
ties. Details of the computation are given in the 
section I4.ll 



2 r 



We compute the source surface density from 
single- wavelength source catalogs to look for vari- 
ations in the source spatial distribution as a func- 
tion of wavelength. In practice, we use a grid of 
12"-pixels, and the value assigned to each pixel 
is the number of sources that fall within 2' of 
the pixel center. Then we convolve the result- 
ing array with a 2D gaussian (2' FWHM) to ob- 
tain a 2'-resolution smooth map of the source 
surface density. Figure [7] shows the surface den- 
sity maps derived in all 5 bands for the two SDP 
fields. This figure illustrates the conspicuous ten- 
dency for the source spatial distribution to evolve 
smoothly with wavelength - in a similar way in the 
two SDP fields - from fairly small compact clusters 
at short wavelengths to looser and larger clusters 
at longer wavelengths. The peak surface density 
is 3 to 4 times higher in 70 jum clusters than in 
500 /urn clusters. Short-wavelength clusters seem 
to be preferentially located around Hn regions, 
and most clusters appear to be coincidental across 
the wavelengths, which is consistent with the clus- 
ters being associated with the same complex, i.e. 
the same parent molecular cloud. We carried out a 
similar MST analysis on single-wavelength source 
catalogs, and we found that the cutoff branch 
length increases monotonically with wavelength, 
going from 85" at 70 /im to 180-200" at 500 fjm 
in the two SDP fields. The cutoff branch length 
determination is very sensitive to the CDF steep- 
ness at the lower end of the branch length scale (cf 
section [3~3f . so that it can be seen as an estima- 
tor of cluster compactness. The increasing cutoff 
branch length with wavelength therefore confirms 
the evolution of clustering properties observed in 
Figure [7J 



PACS 70 ong.^:/!? - 1 ^! -- 



t£l 



2 4 6 8 10 12 14 
Distonce [kpc] 



Fig. 6. — Median MST branch lengths (open di- 
amonds), derived per cluster, as a function of the 
cluster distance. The top and bottom dot-dash 
lines show the Jeans lengths for two sets of cloud 
physical conditions, (30 K, 10 3 cm~ 3 ) and (20 K, 
10 5 cm -3 ), respectively. The two slanted dashed 
lines represent the angular resolving power of Her- 
schel in the SPIRE 500 and PACS 70 bands. 



The evidence of a smooth chromatic evolution 
of Hi-GAL sources spatial distribution is a re- 
markable result, but it needs to be tested against 
possible observational biases that could account 
for the proposed trend. We first rejected the vary- 
ing angular resolution of the Herschel telescope 
between 70 and 500 /im as a possible cause for 
the observed density maps by convolving Hi-GAL 
maps in each band with the SPIRE 500 beam, and 
by detecting compact sources from these images 
and computing source surface density maps as in 
Figure [7] It appears that the smooth evolution of 
the source spatial distribution as a function of the 
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wavelength remains, which confirms that the Hcr- 
schcl angular resolution across PACS and SPIRE 
bands cannot account for the observed results. We 
also considered extragalactic contamination as a 
potential bias in our analysis, mostly as a popu- 
lation of homogeneously distributed background 
sources at SPIRE wavelengths (250-500 fxm). We 
ruled out this option on the ground that (1) long- 
wavelength source clusters are spatially correlated 
with the molecular gas distribution derived from 
13 C O integrated maps fro m the Galactic Ring Sur- 



vey ( Jackson et al. 20061 ). and (2) the low extra- 



galactic number counts from the H- ATLAS survey 
(less than 1 galaxy per squar e degree is expected to 



be brighter than 800 mJy IClements et al.ll2010h 
combined with the high confusion noise (>1 Jy) 
arising from cirrus clouds emission in the Galactic 
plane (|Martin et al.ll2010h makes it very unlikely 
to detect more than a few extragalactic sources in 
Hi-GAL fields. Additionally, we explored the im- 
pact of the relative sensitivity in Herschel bands 
that might also play a role in the observed spa- 
tial distribution. We have computed source den- 
sity maps similar to those in Figure [7] for various 
subsets of single-band source catalogs by keeping 
only the brightest objects (flux limits were the 
first, second and third quartiles of the source flux 
distribution). It appears that the faintest sources 
are mainly isolated objects while the brightest 
sources, independently of the wavelength or field, 



are concentrated in the densest re gions 



consistent with the conclusions of Kirk 



This is 
Myers! 



( 2010l ) that the most massive stars, and presum- 
ably the brightest objects, are generally located 
near the center of local source overdensities. How- 
ever, the measured flux at a given wavelength (and 
heliocentric distance) is not a direct tracer of mass, 
but can also change with the YSO evolutionary 
stage. Additionally, the higher concentration of 
bright objects in the densest regions could par- 
tially be due to the larger number of individual 
objects blending into the telescope beam making 
the central objects appear brighter. Flux-limited 
source density maps are therefore difficult to in- 
terpret. 



If the observed evolution of the spatial distribu- 
tion with wavelength is genuine, then it would im- 
ply the existence of two distinct populations of ob- 
jects. In particular, we argue that sources detected 



at SPIRE wavelength due to cold dust emission 
might be a mixture of stable and transient den- 
sity enhancements in the ISM, which appear to ex- 
hibit moderate clustering in relatively loose associ- 
ations, typically along cold fi l amentary structures 



(e.g iMolinari et~aHl2010at iMen'shchikov et al 



2010), whereas more evolved objects such as pro- 
tostars, which emit most of their energy in PACS 
bands due to the presence of accreting warm cir- 
cumstcllar material, are grouped in smaller and 
more compact clusters around Hn regions. We 
could further speculate that the feedback from 
massive stars on their immediate surroundings 
might have induced the collapse of neighboring 
cores, causing protostellar clusters to grow more 
efficiently around these Hn regions. These consid- 
erations are consistent with scenarios of triggered 
star formation. However we would need a reliable 
identification of the physical properties and evo- 
lutionary stages of the Hi-GAL sources, based on 
their SED fitting, to propose a firmer interpreta- 
tion of our results. Such information should be 
available for later analysis of the entire Hi-GAL 
data set. 

4.4. Infrared Dark Clouds 

The earliest phases of star formation seem 
to occur preferentially in cold (T<20 K) and 
dense (n(H2)> 10 4 cm" 3 ) filamentary struc- 
tures generated by gravity, turbulent mechanisms 
and m agnetic fields in the interstellar medium 



(e.g. lAndre et al] l2010t iHenning et ail l2010l ) 



These structures are ubiquitous in the interstel- 
lar medium. In some cases, filaments are seen 
in absorption against the bright background in 
the Galactic plane, even at mid-infrared wave- 



lengths, where the e xtinction is fairly low (jLutz 
19991 : lFlahertvl[2007h . in which case they trace 
the densest structures. Following this defin- 
ing property, these objects are named Infrare d 
Dark Clouds (IRDC, e .g. ICarev et all Il998l) . 
Peretto fc Fuller] (j2009l ) used Sp itzer near- and 



mid-i nfrared surveys, GLIM PSE (B enjamin et al 



20031 ) and MIPS GAL (ICarev et al.ll2009h . to ob" 



tain a census of IRDCs in the Galactic plane. 
We exploit the iPeretto &: Fullerl IRDC catalog to 
look for associations between Hi-GAL sources and 
these dark filaments of cold matter. We derive 
the fraction of Hi-GAL sources that fall into two- 
dimensional ellipses generated from the morpho- 
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Fig. 7. — Source density maps derived at all 5 Herschel bands (from left to right: detections at 70, 160, 
250, 350 and 500 fxm) for the 2 Hi- GAL SDP fields at I = 30° (top row) and I = 59° (bottom row). Short- 
wavelength sources are grouped in dense compact clusters while long- wavelength sources are distributed in 
looser and larger groups. 



logical parameters given in the IRDC catalog (RA, 
DEC, major and minor axis, and position angle). 
We find that 32% and 19% of the sources in the 
I = 30° and I = 59° fields, respectively, are coinci- 
dent with IRDCs. Among those sources, 93% and 
98%, respectively, are classified as cluster mem- 
bers by the MST analysis. In contrast, only 6% of 
the sources from the control distribution arc co- 
incident with IRDCs. This indicates that IRDCs 
are indeed associated with far-infrared sources, 
and thus to the earliest phases of star formation, 
especially with clustered sources, but it appears 
that a significant fraction of sources have no con- 
nections with IRDCs. The main reason for this 
moderate source/IRDC association rate is likely 
due to the fact that IRDCs do not trace systemat- 
ically dense filaments in the interstellar medium, 
but only those that contrast well with bright in- 
frared backgrounds. A better approach would be 
to rely on the sub-millimeter emission of these 
dense cold filaments rather than on their extinc- 
tion properties. This would provide a more com- 
plete determination of dense filamentary struc- 
tures, inde pendently of the ba c kgrou nd surface 
brightness. Men'shchikov et al. ( 2010l) present a 
qualitative examination of the associations of fil- 
amentary structures and compact objects in the 
Aquila and Polaris clouds, and they find that most 
sources lie within filaments. 



4.5. HII Regions 

We want to quantify the observed grouping of 
Hi-GAL sources around Hn regions as mentioned 
in section |4~T] We use the catalog of Hn regions 



compiled by iPaladini et al. (|2003h to derive the 



distance from each Hi-GAL source to the cen- 
ter of the closest Hn region. The / = 30° field 
contains 25 Hn regions with radii ranging from 
1.6' to 13' and a median value of 4.3', while the 
I = 59° field contains only 2 Hn regions. We will 
therefore focus our analysis on the I = 30° field 
due to the higher statistical significance we can 
reach in this field. 

Figure [8] shows the distance histograms when 
considering all the sources observed in the I = 
30° field, as well as the random control distri- 
bution for comparison. About 30% of the ob- 
served sources are located within a Hn region, 
and the rest of the sources appear to be dis- 
tributed closer to the Hn regions than randomly 
distributed sources. In fact, over a third of the 
observed sources are clos^l to the Hn region bor- 
der. Yet this occurs for only 17% of the ran- 
domly distributed sources. Such a YSO den- 
sity enhancement is consistent with the collect 
and collapse scenario of triggered star formation 



2 The border is loosely defined as an annulus of inner and 
outer radii of 0.5 and 1.5 times the actual radius of the 
presumably associated Hn region. 
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(e.g., IWhitworth et al.lll994t[Zavagno et alj|2006h 
in which a layer of gas and dust is compressed be- 
tween the ionization and shock fronts produced 
by an expanding Hn region. Nevertheless, 55% of 
the observed sources fall outside a circle 1.5 times 
larger than the Hn region radius, which means 
that a significant fraction of the Hi-GAL sources 
might not be associated with any Hn regions. 

We further investigate the chromatic spatial 
distribution mentioned in section 14.31 with respect 
to Hn regions. We select 70 /mi and 500 /urn 
sources from single-wavelength source catalogs, 
and we repeat the above analysis for the I = 
30° field (cf Figure©. We find that 36% of 70 fim 
sources are observed within an Hn region, with 
a median distance to the center of the region of 
~5' (similar to the median radius of Hn regions), 
against only 19% for 500 fj,m sources, with a me- 
dian of ~7'. This confirms that short- wavelength 
Hi-GAL sources are preferentially located within 
or around Hn regions compared to their longer 
wavelength counterparts.. 

5. Summary and future work 



We have characterized the spatial distribution 
of far-infrared sources in the 2 Hi-GAL SDP fields. 
We first derived reciprocal distance matrices, then 
we built Minimum Spanning Trees including the 
heliocentric distance estimates necessary to sep- 
arate clusters al ong the line of sight. F ollowing 
the formalism of Gutermuth et al. (2009) we have 
identified and characterized 21 clusters across the 
2 fields. The 3 largest associations have more than 
100 members and contain over a third of all the de- 
tected sources. The clusters are mildly elongated, 
with radii in the range 1 to 20 pc, and a median 
density of 2 sources. arcmin" 2 . Half of the clusters 
are likely associated with Hn regions, and most 
IRDCs in the SDP fields are associated with clus- 
tered sources. However we were unable to evidence 
the imprint of fragmentation in molecular clouds 
further than ~1 kpc due to the limited angular res- 
olution and sensitivity of the survey. Nevertheless 
our analysis revealed the existence of 2 popula- 
tions of YSOs with distinct clustering properties: 
short-wavelength sources tend to be clustered in 
dense and compact groups while long-wavelength 
sources are clustered in looser and larger groups, 
with a somewhat continuous evolution between 
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these two clustering regimes as the wavelength 
increases. This remarkable result is based solely 
on monochromatic source density maps and mini- 
mum spanning trees, and any interpretation would 
be speculative at this point. We rather need to 
derive the physical properties of Hi-GAL sources, 
characterize the spatial distribution of the differ- 
ent classes of YSOs according to their evolutionary 
stage for instance, and look for similar spatial seg- 
regation effects. However the task of fitting SEDs 
in such crowded fields is very difficult due to the 
larger beam at longer wavelengths. Our team is 
now working on improving the reliability of SED 
fitting in dense clusters. 

We have exploited only 3% of the Hi-GAL sur- 
vey in the present analysis. With the remaining 
260 square degrees to be covered by Hi-GAL, and 
better tools for characterizing the physical prop- 
erties of YSOs, we expect to harvest an unprece- 
dented wealth of information with high statisti- 
cal significance. We will study the cluster mass 
function, as opposed to the individual source mass 
function, and look for correlations between clus- 
tering properties and mass, evolutionary stage, en- 
vironment and the galactocentric distance. In ad- 
dition, the proposal for a Hi-GAL n survey has 
recently been accepted. It will cover ~ 270 square 
degrees of the outer Galactic plane (spread by 
60° of Galactic longitude on either side of the 
Galactic anticenter). This will increase the statis- 
tics and allow us to probe a more quiescent part of 
the Galaxy where the star forming regime might 
be different than the one observed with Hi-GAL. 

The authors would like to thank the entire Hi- 
GAL team, in particular the Distance Working 
Group for their colossal effort to obtain distance 
estimates for the majority of Hi-GAL sources. 

Facilities: Herschel Space Observatory (PACS 
& SPIRE Parallel mode). 
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